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Memo 16--A Question-Answering Routine ^ 

by 

Anthony Valiant Phillips 

A program has been written in the LISP programming language 
to answer English-language questions by consulting an English- 
language text. The program can handle questions about the sub¬ 
ject, verb, place and time of simple sentences. The program pro¬ 
ceeds in two steps. In the first, the machine analyzes the cues- 
tion and the sentences of the text, and puts them into a form in 
which they can be compared. For this analysis the machine must 
have as input a dictionary of part-of-speech tag3, and a set of 
rules, analogous to phrase-structure rules, according to which it 
will organize the sentences.; This analysis organizes the sentences 
into noun-phrases, verbs, and prepositional phrases. The machine 
then picks from the sentience a subject, a verb, an object, and pre¬ 
positional phrases relating to place and time. This 13 the "canon¬ 
ical form" of the, sentence* 

The next part of the program compares the question with each 
of the sentences In the text. Those that match, i.e. contain the 
information the question is,asking for, are stored and the answer 
is made up from them. If none are found, an appropriate negative 
answer is given. 

This routine has been debugged and has run successfully. 
Introduction 

The original aim of this research was a routine that could 
pass a reading-comprehension text suitable for six or seven-year- 
old children, and even though we had to lower our sights, this 
still gives the best idea of the purpose of this paper. By £he 
time a child has learnt to read, he is already capable of feats 
of data analysis that seem very difficult, if not impossible, to 
duplicate on a machine. The child has picked up the key to 
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a^d Development Command) and the U.S. Navy (Office of Naval Re¬ 
search). In addition, this work was done in part at the MIT 
Computation Center, Cambridge, Massachusetts. 
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synfcactic analysis, and has begun to assimilate the huge mass' of 
common knowledge that is taken for granted in human communica¬ 
tion. He is also becoming initiated, although slowly*, to the 
habit of reasohing, and is learning to handle logical connectives 

and quantifiers. These are all directions this paper did no; ex¬ 
plore. ) 

> 

' The idea behind the answering process that will be described 
in the following pages is very simple. We. assume that each ques¬ 
tion asks for a piece of information, and that pieces of infor¬ 
mation are contained in the text, one to a sentence. The answer¬ 
ing routine consists then in trying to match the question against 
successive sentences of the text to see if any one of them con¬ 
tains the information asked for. 

• f 

The routine first performs an analysis of- the question and 
of each sentence of the text, and edits them into a form in which 
they can be compared. It then performs the comparison, and prints 
out appropriate answers. The following examples are taken from 
actual machine runs, not using the three functions mentioned above. 

Examples: text: ((AT SCHOOL JOHNNY MEETS THE TEACHER) 

(THE TEACHER READS BOOKS IN THE CLASSROOM)) 

question: (WHERE DOES THE TEACHER READ BOOKS) 

answer: (((IN THE CLASSROOM) (THE TEACHER READS BOOKS IN THE 

•CLASSROOM))) . 

♦ . 

text: (JOHNNY GOES TO SCHOOL IN THE MORNING) 

9 

question: (WHOM DOES JOHNNY MEET) 

answer: (THE ORACLE DOES-NOT KNOW) 

Additional input is required for the sentence analysis: in • 
the examples above, the machine had also been fed three lists. 
(These are part of the "a-list".) The first, denoted by GRAMM1, 
was a part-of-speech "dictionary" of the form ((M THE) (N TEACHER) 
(V READS) (N BOOKS) (P IN) (NPL CLASSROOM) (P AT) (N JOHNNY) 

(V MEETS) (V GOES) (P TO) (NTI MORNING) (NPL SCHOOL) (AUXl DOES) 
(AUX1 DO) (V READ) (V SIT) (Q-WHERE) (Q WHAT) (Q WHOM) (N MARY)). 


See Piaget and Inhelder, La Genese des Structures Logiques 
, Elementaires , Delachaux eF"Niestle, lJeuchateI,’"±959. 
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The second list, GRAMM2, is a set of "grammar rules" which the 
machine uses to analyze the sentences and the question. For the 
examples above, the GRAMM2 list was (((M NPL) NPL) ((M N) N) 

((P NPL) PNPL) ((P NTI) PNTI) ((M NTl) NTl) ((p NTl) PNTl)). The 
third list, referred to as Z. is of the form (((DOES GO) GOES) 
((DOES DO) DOES) ((DOES READ) READS)). These three lists, the 
sentence and the text are processed by the program described in 
the following pages, and printed out in A 

Work of a similar nature is being done at the Lincoln 
Laboratory of MIT in B. F. Green’s group. They are interested 
in answering questions pertaining to a table, in this case to the 
table of 1958 baseball scores. Their work should be nearly 
finished, but up to now little information has been released 
about the details of their program. (See the Lincoln Laboratory 
QPR for Division 5> December 15> 1959*) I believe that they have 
had to analyze special segments of the syntax more thoroughly 
("exactly three"more than", etc.) becuase of the nature of 
their text and the questions expected about it. 

Our exposition will proceed in the following manner: first, 
to explain how our answering process works, we shall describe a 
simple example of a question-answering routine, answerl. This 
routine was written at the beginning of this study, and was de¬ 
bugged and will run successfully as it is presented here. It 
answers simple-minded questions about a table, but its basic 
mechanism is the same as that of the large program. After that 
we will present a description of the main program, followed by a 
more detailed examination of each of its subfunctions. 


A Simple Example: The program answerl 

_ 9 

answerl takes as input a question and a table. The table 
tells whether or not a relation R exists between any two letters 
of the alphabet.* The table is arranged as follows: 

(ARB BRC ARC...) 

The questions that may be asked are: 

Does ARB? 

What R B? 

What does B R? , where A, B may be any two letters. 

t 

? The program described; searches the "text" for an answer to 
the question. There is no provision for recording two or more 

. | . 1 ______—-- ■ ■ ■ — — 

It* will be seen below that S cannot be used. 
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an3v;ers if they should occur. Thus if ARC and A R D both ap¬ 
peal* in the table, the answer to the question "What does A R? " 
will be A R C" if this entry happens to come first in the 
table. The answer will be "A R Nothing” if there is no entry of 
the form A R ... 


answerl works as follows: lb first performs qanal of the 
question, translating it into a form where, it can be Easily 
matched against the table: 

V 

qanal changes "Does A R B?" into ARB 

"What R B?" into S R B 

"What does B R?" into BRR S . 


It then performs test to compare the analyzed question with the 
text, test uses testa to match the question against each suc¬ 
cessive table entry. If no matches are recorded, the subroutine 
S is performed. Which takes the question as input and supplies 
the correct negative answer: "A doe3 not R B" ^Nothing R B" 
or "3 R nothing". 


fc es fca matohes the question against an entry by using same 
to compare the *words of the question with the corresponding 
words of the table entry, same will register a match if the 
two elements it is comparing are identical, or if the question- 
element is "S", the symbol we are using for "what". This is 
expressed in the following table: 


• 4 

element of table-entry 


A B 


element of 
question A 


S 


The LISP program for answerl* 


We will describe qanal in more detail than the rest of the 
program: We want qanal to operate as follows: 


MATCH 

NO 

MATCH 

' 

MATCH 


A 

B 

C 


Is first word "does"? 


noN 

f 


Is second word n R l, ?l_ 

no> 


value is 3rd word, R, S 


yes 


value is 

2nd word, R, 4th word 


value is 
S, R, 3rd word I 


fr~ ————--— - ---- 

for a description of the LISP language see the paper by John 

McCarthy entitled "Recursive Functions of Symbolic Expressions 
and Their Computation by Machine" in Communications of the ACM, 
Vol. 3, No. 4, April, i960. ~ 
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We then express A as: 

car[x] «* quote[D0E5] ~+ list[cadr[x] ,quote[n] ,cadddr[X] ] 

B and C are expressed similarly, and we write: 

qanal[x] -[car[X] ■ quote[DOES] -♦ list[cadr[X];quote[R]; 
cadddrtx]];cadr[x] - quote[R] list[quot:>[s];quote[R]; 
caddr[x]];T “■* list[caddr[x];quote[R]:quote [s] ]] 


answerl and its other 3ubfunctlons oan be described in a 
similar manner. We want them to act according to the flowchart: 



: XJS3 definitions: 

♦ 

\ • ■ -* 

answerl[X;Y] = test[qanal[x];Y] 

te3t[X;Yj - [null[Y] — neg[x];T“+ testa[X;Y]] 

testa[X;Y] *» [same[car[X}; car[Yj ] “*■ [same[caddr[x]; 

caddr[Y] ] -*• list[car[Y];quote[R];caddr[Y]}; 
T ”*■ test[X; cdddr[Y] ] ] ;T "*■ test[X;cdddr[Y]]] 
samelX;Y] - [X - Y T;X - quote[S] -*• T;T -*• P] . 



<*•» ( 5 *“ 

neg[x] - [car[x] - quotc[s] list[quote[NOTHING];quote[R]; 

caddr[x]];caddr[x] - qnote[S] -♦ llst[car[X];quote[R]; 
quo;e[NOTHING] ]; T “*■ list [ car [ X]; quote [DOES]; quote [NOT]; 
quobe[R];caddrtx]]] 


We now'wish to apply a similar matching' procedure to an 
English-language question and text. The principal task will be 
to put the text in the form of a table, and the question in a 
form suitable for comparison with the "table" entries. This 
work will be accomplished, by the functions lex , parse , revert ;, 
edit , order which make up the "syntactic analysis" portforPof 
the'routine. The matching will be done by answer , que3 and 
anal . These are all subfunctions of the main function, oracle , 
whose arguments are a question and a text, and whose value is 
the answer or answers to the question found in the text. 

The function oracle ; 

I. The Syntactic Analysis 

It was outside the scope of this project to embark on a 
systematic and correct analysis of even a very restricted class 
of English sentences. Accordingly a number of artificial as¬ 
sumptions were made about the sentences to be analyzed. The 
most important one is that no ambiguity will be encountered at 
any level in the analysis. As we assign part-of-speech tag3 to 
the words in the analyzed sentences, this assumption means, for 
instance, that a given word can be only one part of speech. 

There is ‘no provision, in the analysis program which will be de¬ 
scribed here, for backtracking or for maintaining two or more 
distinct hypotheses about the straature of the sentence. The 
analysis proceeds in two steps. 

The first step, embodied In the functions lex and parse , 
goes through the sentence looking for nouns and grouping with 
each one the adjectival structure that belongs to it. If a noun 
is prece.ded by a .preposition, a prepositional phrase is formed. 

The sentence is presented to this analysis in the following 
form; each word is feagged with a part-of-speech symbol. Here 
the symbols used are N (noun), M (noun-modifier), V (verb), 

NPL (place-noun, e.g. playground), NTI (time-noun, e.g. noon)*, 

P (preposition), and AUXi for forms of the verb to do which 
play a privileged role. (This format is the result of applying 
lex to the sentence and the part-of-speech "dictionary".) Also 


this distinction is necessary to distinguish "in the garden" 
(answer to"where?") from "in the afternoon" (answer to "when?"). 


h 
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part of the Input la a act of rules for forming noun-phrases and 
prepositional phrases*, e.g. 

M + N *» N 

M + NTI “ NTI 

P + NTI - PNTI, • etc. 

There are also rules of the form 

V + V - V 

for grouping an auxiliary-participle combination into a single 
element. ("M + N * N" means that when a word or group of werda 
tagged M is followed by a word or group of words tagged N, the 
two groups should be consolidated into one, tagged N.) 

An example of theaaction of lex and parse : 
sentence (JOHNNY SEES THE DOG IN THE GARDEN) 

"dictionary" ((N JOHNNY) (N DOG) (NPL GARDEN) (V SEES) (? IN) 

(M THE)) 

rules (((M N)NN) ((M'NPL) NPL) ((P NPL) NPL)) 

The three inputs are shown in the format in which they are accep¬ 
table to the routine. ((M N) N) is a representation of M + N « N 
The result of the analysis is: 

((N JOHNNY) (V SEES) (N THE DOG) (PNPL IN THE GARDEN)). 


Up.to this point the analysis is in some sense independent 
of the routine since, if the ambiguity clause is respected, "dic¬ 
tionary" and grammar rules are part of the input. This concludes 
the first step. 

The second step: It was decided that the ■routine would-be 
written only to handle questions dealing with the subject, object 
place-phrase and time-phrase of a sentence. This implies im¬ 
mediately, for example, that only simple sentences will be con¬ 
sidered. The function order does tJHis editing. Its result is a 
list of five elements, subject, verb, object, place and time, in 
that order. Anything else in the sentence will be discarded; 
missing elements, except subject and verb whose presence is re¬ 
quired, have their place marked by an (S). We give an example of 
the combined action of lex , parse and order ; these three func¬ 
tions are responsible for the analysis of the sentence. 


U- 

These rules are inspired by Noam Chomsky's phrase-structure 
rules. (See his Syntactic Structures , Mouton and Co., 

1 S-Gravenhage, 1957, p. 26) 




*1 


sentence 


* 


(IN THE GARDEN THE LITTLE 


BOY GAVE THE DOG A BONE) 


dictionary ((NPL GARDEN) (N BOY) (N DOG) (N BONE) (V GAVE) 

(M A) (M THE) (M LITTLE) (P IN)) 

rules (((M N) N) ((MNPL) NPL) ((P NPL) PNPL)) 

result: ((THE LITTLE BOY) (GAVE). (THE DOG) (JN THE GARDEN) 


( 3 )) 


The routine cannot handle double objects. 


The analysis of the question uses the -same functions: lex, 
parse and order. It also needs revert to cope with the inyer.iion 
cTTaracter 1 sticof the question-sentence in English, and edit so 
take care of question-words, revert requires as part of its in¬ 
put a table showing how to combine the auxiliary t£ do with par¬ 
ticiples, e.g. 

x DOES + GIVE => GIVES 

DID + GO - WENT, etc. 



We give examples of the, application of lex , parse , revere , 
edit and order , in that order, to Interrogative sentences: 

sentences •(WHERE DID THE TEACHER GO) 

(HAS JOHNNY COME HOME) 

dictionary ((Q, WHERE)* (ATJXl DID) (M THE) (N TEACHER) (V GO) (V .A.S)^ 

(N JOHNNY) (V COME) (PNPL HOME)) V 

rules ,(((MN)N)) 

table ' (((DID GO) WENT)) 

results . ((THE TEACHER) (WENT) (S) (W)* (S)) 

((JOHNNY) (HAS COME) (S) (HOME) (S)) 


II. The Matching 


The' sentence' and the question ’under consideration have now 
been edited into a canonical form in which they^ can oe compared 
The comparison proceeds as follows: the first block in tne 
"canonical form" of the question is matched against the first i 
that of the sentence, and so forth. If all agree, question and 

sentence are said to match. 




n 


The blocks are compared as in the table below, where T appears 
at the conjunction of two entries if a match would be recorded, and 
F if not. 


*q is a part-of-speech tag for question-words; (w) in fourth 
position represents WHERE. 






Block: n of the 
sentence Is: 


Block n of the 
question is: 


(S), (A B) (C) 

(S) 

(W) 

(B) 


T 

T 

T 

F 

T 

T 

.F 

TT 

F 


For example, if block n of the question Is (THE OLD YELLOW 
DOG) and block n of the sentence 13 (-THE UGLY OLD YELLOW DOG), 
these blocks will match. If block n of the sentence were (THE 
DOG) or (THE YELLOW CANARY), there would be no match. If block 
n of the question is empty, it will match whatever block n of the 
question contains. If block n of the question is a question-word, 
(e.g. a (W) is the first block representing a "who"), it will 
match with block n pf the sentence as long as the latter block is 
not empty. 


The functions of oracle 

lex[X;Y] has as arguments a list of words, the sentence, and a 
list of pairs, the "dictionary". Its value is the list of 
the dictionary entries corresponding to the words in the 
sentence, lex has a subfunction lookup. 


llookup[X;Y] has as arguments a word and a list of pairs. Its 
value is the pair whose second element is the word in ques¬ 
tion. Its value is ERROR if there is no such pair. 

Examples: 

X « JOHNNY 

Y - ((N MARY).(N JOHNNY) (V SITS), etc. ) 
lookup[XjY3 » (N JOHNNY) 

X (JOHNNY GOES TO SCHOOL) 

Y - ((P TO) (N MARY) (V GOES) (N JOHNNY) (NPL SCHOOL)) 
lex[X;Y] - ((N JOHNNY) (V GOES) (P TO) (NPL SCHOOL)) 

LISP definitions: 

lookup[X;Y] « [null[Y] — ERROR:T — [X » cadar[Y] car[Yj; 

T “*■ lookup[X;cdr[Y]]]j 

lex[X;Y] ■» [null[X] NIL:T“* cons[lookup[car[x];Y]; 

lex[cdr[Xj;Y]]i 

parse [X;Y] has as arguments a list of pairs (the result of lex 
of the sentence) and another list of pairs, the rules. Elem¬ 
ents of this second list are of the form ((A B) C),. correspond 
ing to rules of the form A + B - C. The work in parse is done 
by the subfunction glob . 




t 
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glob[X;Y;Z;W;V]: X is a marker which tells whether or not ary 
rule has been applied in the last pass through the sentence. 
Y contains the sentence elements that have already been 
examined in the current pass; Z contains the rest of the 
sentence; W contains the rules that have nob yet been tried 
in the analysis of the first two elements of Z; V contains 
thfl>se that have been tried and have failed. The action cf 
glob is described in the flowchart below, where Z , repre¬ 
sents the first element of Z (car[z]) and Z 12 represents 
the second element of Z,. - > 


START 



X - ((M THE) (M NASTY) (M OLD) (N MAN)) 
Y i ,(((M N) N)) 

parse[X;Y] « ((N THE NASTY OLD MAN)) 


LISP definitions: 


glob[X;Y;Z;W;V] = [null[z] [nulltx] -♦ Y; T 

glob[NIL;NIL;Y;W;V]];null[W] -*■ glob[X;nconc[Y;cons[car[z]$ 

NIL] ];cdr[z] :V;NIL];list[caar[Z.j;caadr[Z]] *» caar[W] 

glob[quote[Sj;Y;cons[append[edartw];append[cdar[z];cdadr[z]]]; 

eddrtz]];W;V];T globLX;Y;Z;cdr[W];nconc[V;cons[car[W]; 

NIL]]]] 


parse[X;Y] - glob [ quote [S];X; NIL; Y; NIL] - 






ord er[X] has as lbs argument a list of pairs, the value of 

parse[lex[sentenco;dictlonary];rulesJ. Its value is a last 
of five elements: it picks out from the sentence a subject 
noun-phrase, a verb, an object noun-phrase, and prepositional 
phrases of place and time, and lists them in that order. When 
one of the last three is missing, it writes (s) in its place. 
order works through the subfunctions picks,pickv, picko, 
plckpl , plckti . All of these take the same argument as order . 

« 

• picks has 

pick s ha3 as value the first noun-phrase before the verb, i.e. 

"Tt looks in the list for an element beginning with N, NPL, 
or NTI and its value is the rest of that element. If it hits 
i the verb before having found a suitable noun-phrase, its 
value is ERROR. 


pi|kv looks in the list for an element beginning with V, or AUXi. 
its .value is the rest of the first such element it finds. If 
IS. it finds none such, its value is ERROR. 

* 

picko looks for a verb followed immediately by an element tagged 
N, NTI or NPL. If the element following the verb is so marked 
the value of picko is the rest of that element. If not, its 
value is (S). 

* 

plckpl looks for an element marked PNPL. If it finds one, 

plckpll looks for more. If more than one is found, they are 
consolidated to form the value of pickpl: if none is found, 
the value is (S). 

plck ti and picktil do exactly the same with PNTI. 

t 

Example: 

X - ((PNTI AT NOON) (N JOHNNY) (V GOES) (PNPL UP THE STAIRS) 
(PNPL TO HIS ROOK)) 

orderCx] » ((JOHNNY) (GOES) (S) (UP THE STAIRS TO HIS ROOK) 

(AT NOON)) 



ISP definitions: 


picks[xj =* [null[x] —<► ERROR;member[caar[x]jquote[ (N NPL N?l)33 

cdar[Xj; member[caar[X];quote((V AUXl)Jj ERROR; 

T picks [cdr[x] ] ] 



pickv[x] » [nullfx] “*• ERROR; member [caar[x3; quote [ (V AUXl)]j “•* 

cdar[X};T “*• piclcv[cdr[xj 3 3 


% 

i* 

* 

t 


picko[x] 


pickpl[x3 


[member[caar[x3;quote [(V AUXl]]] 
quote[ (N NPL NTI)3] — cdadr[XjjT 

T “♦ picko[cdr[x3 3 3 

1 [null[x] quote[ (s) 3;caar[x3 » 
append[cdarlXj;pickpll[cdr[x3 3 3 


[member[caadr[x3; 

quo te [ (3)3; 


quote[PNPL3 

;Tpickpl[cdr[x]33 


* 


pickpll[x3 * [null[x3 ’"*■ NIL:caar[x3 ** quote[PNPL3 append[cdar[x3; 

pickpll[cdr[x3 3 3 ;T-*■ pickpll[cdr[x3 3 3 


pickti[x3 - [null[x3 quotef (s) 3; caar[x3 « quote[PNTl] -*• 

append[cdarfx3;picktil[cdr[x333;T pickti[cdr[x3 3 3 

picktilCx} - [null[x3 -♦ NIL:caar[x3 - quote[PNTl3 — append[cdar[X3; 

picktll[cdr[x3j3;T picktil[cdr[X]33 


order[x3 - list[picks[x3;pickv[x];picko[x3;pickpl[x3;pickti[x33 ^ 

revert [x3 is a function of a list of pairs*, the result of ap¬ 
plying lex and parse to the question. Its value is a list 
of pairF*T It rearranges the "parsed" question to permit 
matching with the text: in particular it undoes the inver¬ 
sion typical-of the interrogative sentence. It has a sub¬ 
function Join which is responsible for grouping the auxi¬ 
liary and "the verb when they have been separated by the in¬ 
terrogative construction, and Join has a subfunction compare 

which deals especially with to do, for which it needs'll- 

table to tell it. DOES + HAVE » HAS; DID + GO = WENT, etc. 

' revert works as follows: (in the following diagram, 

S"Wi,W2,... ,Wn represents the sentence, and "Wi *» z? " 
means: "Is the tag on the first word or group of words (as 

formed by parse ) Z?") 


Actually a list of lists.’ We consider each list as a oair whose 
first element is that of the list, and whose second element is 
the rest of the list. 



Lw-v 

*» 0? 

1 yea. 

f 

r - 3 .—* 

3 

S 

i 

Then set 

no 



no ! 

V 

I 

i 

!S « W3, jointv;2:W‘tj, ; 
Wl,W5,...,Wn ; 


* 

m 

[Then set 




> 


1 S ® S 




\y 
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Examples: 

«•> 

X - ((Q, WHERE) (AUX1 DOES) (N THE MAN) (V GO)) (A) 

table contains ((DOES GO) GOES) 

revert[X] - ((N THE MAN) (V GOES) (Q WHERE)) 

X - ((Q WHO) (AUX1 DOES) (N THE WORK)) ( B ) 
revert[X] *=* ((Q WHO) (AUXi DOES) (N THE WORK)) 

f 

X » ((V HAS) (N THE PROFESSOR) (V GONE) (PNPL HOME)) ( C ) 
i i revert[X] « ((N THE PROFESSOR) (V HAS GONE) (PNPL HOMS)) 

'x - ((V HAS) (N JOHN) (N A LIZARD)) ( D ) 
j revert[X] - ((N JOHN) (V HAS) (N A LIZARD)) 


JointX;Y] is a function of two pairs. Its value is a list 
formed from the first element of the second pair, and 
either the second elements of both pairs or the value of 
comoare of the second elements, when it has to do with to do. 

J___r*_-_ * 


compare [X;Y.;Z] is a function whose first two arguments are rairs 
and whose third is a list of pairs of the form ((DOSS GO) GOE 


({DID MEET) MET), etc. It forms the list of the last halve 


3 




of its first two arguments and tries to match this with 
first- half of one of the elements of its third argument. Its 
value is then a pair made up from the second half of that ele 
ment, preceded by the first half of the second argument. Its 
action is simple and easy to understand in an example. ( 
value is ERROR if no match is found.) 


^Its 




< ' 

Example: 

H 

X » (auxi does) 

Y « (V GO) 

Z contains ((DOES GO) GOES) 

compare[X;Y;Z] - (V GOES) 

\ 

LISP definitions: 

revert[X] - [caar[X] - quote[Q] -*• [caadddr[x] « quote[V] ^ 

appe nd [ 1 is t [caddr[X];Join[cadr[X];cadddr[X]]]; 
cddddr[x]];T ( ^ X]jT-* [caaddr[x] - quote[V]*^ 
append[list[cadr[x];Join[car[x];caddr[x]]];^ 
cdddr[x]];T^ appendtlist[cadr[X];car[x]]; 

cddrCx]]]] 

join[X;Y] «* [car[x] « quote[AUXl] -* compare[X;Y;Z];T 

append[append[cons[car[Y];NIL];cdr[ X ] ]; 
cdr[Y]]] 

compare[X;Y;Z] - [null[Z] “♦ ERROR;append[cdr[x];cdr[Y]] - 

caar[z] -*■ append[cons[car[Y];NIL];cdar[zj]; 

T -* compare[X;Y;cdr[z]]] 

edit[x] has as argument and value a list of lists, the result 
of j; ex *- Parse and revert of the question. Its responsibility 
is to substitute for the question-words the symbol W with the 
appropriate tag. 

Example: 

x «• ((q who) (q what) (q whom) (q when) (q where)) 

edit[X) - ((N W) (N W) (N W) (PNTI W) (PNPL W)) 

LISP definition: 

edit[x] » [null(X] -+ .NIL; member [cadar[x]j quote [ (WHO ’WHAT WHOM)] 
t ' append[quote[((N W)).]; edit [cdr [x]]]; cadar[X] « 

quote[WHERE] — append[quote[((PNPL W)) ];edit[cdr[x]]] 
cadar[X] «■ quote[WHEN] append [quote [( (PNTI W))]; 
edit[cdr[x] ]]jT cons[car[X] ; edit[odr[’X] j ] ] 


match[X;Y] is a function of two lists Of lists. Jt3 value ic T 
(True) if the elements of each list in X are contained in the 
corresponding list in Y, and F (False) otherwise, match has a 
subfunction, contained which checks the individual lists cne 
against the other, a3 follows: 


contalned [X;Y] i3 a function of two lists. Its value i3 T $True) 
if X is (s), if X is (W) and Y' is not (S). or if each element 
of X appears in Y. Its value is F (False; otherwise, contained 
uses member which was defined ,to be the built-in function 
membob’ii 


Examples: 

X - (THE OLD MAN) 

Y - (THE OLD MAN IN THE MOON) 
contained[XjY] - T 

X - (THE MAN) 

Y - (the OLD woman) 

contained[X jY] ■ F 

(Note also that X - (THE OLD OLD MAN) 

Y - (THE OLD MAN) 
contained(XjY] ■ T) 

* 

X - ((.JOHNNY) (COMES) (S) (HOME) (S)) 

Y - ((JOHNNY APPLESEED) (COMES) (S) (HOME) (AT NOON)) 
match[X;Y] - .T 


X « ((JOHNNY) (COMES) (HOME)) : ' 

Y - ((MARY) (COMES) (HOME)) 
ir.atch[X;Y] «* F 

\ , 

* i 

LISP definitions: 

contained[X;Y] « [null[xj T;car[X] = quote[S] -*■ T; 

car[X] » quote[W] A^carCY] » quote[S] T; 

member[car(x];Y]Acontained[cdr[X];Y] “*■ T;T F] 

< 

* 

match[X;Y] » [null[x] "**' Tjcontained!car[X]jcar[Y]]match[cdr[x]; 

cdr(Y]] -*■ T;T -*• P] 
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an3wcr [X;Y;Z] is a function of three arguments: X is the pro¬ 
cessed question; Y is a list of processed sentences; Z ac¬ 
cumulates the sentences of Y which have matched successfully 
with the question X. Its value is an appropriate answer to 
the question, answer operates as follows: 


'i 

, i 


i k 1 


i 



anal is a subfunction of answer . 

anal]X;Z] is. a function of two arguments: the first is a list, 
the question; the second is a list <bf lists, the answers. The 
responsibility of anal is to pick out the words in the answers 
which matched.with the question-words (who, where, etc.) in the 
question. The actual searching is done by ques . 

ques [X;Z] has as its arguments two lists: X is the question, Y 
“is one of the answers, ques searches through the question 
for 'V's and its value is "the list of the elements of the 
answer that correspond to "W'^s in the question. • 

Examples: 

. * 

X - ((W) (COMES) (W)) 

Y = ((JOHNNY) (COMES)’(HOME)) 

. ques[X;Y] « ((JOHNNY)' (HOME)) 

X - ((W) (COMES) (S)) 

9 

Y « (((JOHNNY) (COMES) (HOME)) ((MARY) (COMES) (AWAY))) 
anal[X;Y] - (((JOHNNY) ((JOHNNY) (COMES) (HOME))) 

((MARY) ((MARY) (COMES) (AWAY)))) ^ 



t 
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Note that in the description below ques is also used as a 
subfunction of anal, to determine whether or not we have a 
ye3-no question. This is done by null[ques[X;quote[(B B B 
B B)]]] which ha3 value T only if there were no "W " 1 3 in X. 

LISP definitions: 

answer[XjY;Z] ■ [ null[Y] -*• [null[z] -♦ [null[ques[x; 

\quote[(B B B B B)]]] "♦ quote!(THE ORACLE SAYSNHO)]; 
T quote!( THE ORACLE DOES NOT KNOW)]]; 

T "* [null!quesfXj quoteC (B B B B B) ] ] ] "♦ 
list!quote!YES];Z];T -*• anal[X;Z]]];T 
[match[X;car[Yj3 -+ answer!X;cdr[Y];append!Z; 
cons!car[Y];NIL]]3;T “*■ answer[X;cdr[Y3;Z333 

anal[X;Z] * !null[Z] “► NIL;T "♦ append[cons[append[ques[X: 

car[z)3;cons[car[z3;NIL]3;NIL);anal[X;cdr[z]j33 

P 

ques[X;Y] « [nullCx] “*• NIL;caar[X] « quote[W] cons[carCY]; 

ques[cdr[Xj;cdr[Y]3];T ques[cdr[X];cdr[Y]33 

textanal[x3 processes the list of sentences: 

textanal[x] « [nullCx) "♦ NIL;T "♦ append[cons[order[parse! 

lex[car[X3;GRAMM1];GRAMM2]];NIL];textanal[edr[X3]1] 

oracle[X;Y] ha& as value the list of answers from sentences in 
~Y to the question X: oracle[X;Y] 

oracle[X;Yj » [fix[answer[order[edit[revert[parse[lex[x3; 

GRAMMi]jGRAMM2]333;textanal[Y];NIL)] 

The values of answer are of the form: 

(((IN THE CLASSROOM) ((THE TEACHER) (READS) (BOOKS) (IN THE 
CLASSROOM) (S)))) 

(YES (((THE TEACHER) (READS) (BOOKS) (IN THE CLASSROOM) (S)))) 

, (THE ORACLE SAYS NO) r ( 


i ' V 


I 



The function fix , with its subfunctions unpare n and unp has C 

the responsibility for rewriting these a 3 .: ~ 

(((IN THE CLASSROOM) (THE TEACHER READS BOOKS IN THE CLASSROOM))) 
(YES ((THE TEACHER READS BOOKS IN THE CLASSROOM))) 

(THE ORACLE SAYS NO) 

fix[x3 « [null[x] NIL;atom[car[x33 -*■ cons[car[x}; 

fix[cdr[x333;T cons[unparen'[car[x]3;flx[cdr[X]]3 ] 

unparen[x] ■ [null[X] NILjT “+■ cons [unp [car [X]];unparen[cdr[x3]]] 

unp[x3 - [null[x] -► NILjatom[car[x33 -* X;car{X] - quote[(s)3 

unp[cdr[x3]jT ^ append{car[x3;unp[cdr[x3 3 3 ) 



. Conclusions 




Conclusions. 


To achieve our original goal of passing a six-year- 
old’s reading-comprehension test we would need, as men¬ 
tioned in the introduction, a complex syntax-recognizer, 
a logic machine and an encyclopedia. Syntax recognition 
by machine will not come tomorrow, but it is on its way. 
Yngve’s hypotheses (see Bibliography) seem both revealing 
of the true nature of language and especially appropriate 
for machine use. Theorem provers have been built, and 
perhaps a program could be written to simulate the logical 
processes of a child. The encyclopedia does not seem 
entirely out of reach, either. With lists of synonyms, 
and tabulation of objects into various categories, e.g. 
animate and inanimate, and perhaps just stockpiling infor¬ 
mation about every-day life, one might be able to recon¬ 
struct the world as a six-year old knows it, the context 
in which he passes his reading-comprehension tests. Cer¬ 
tainly programming languages of high sophistication, such 
as LISP, used here, or COMIT*, designed for convenient 
manipulation of symbolic elements, will be an essential 
part of such an undertaking. From here, it looks feasible 

w 


See Yngve, "A programming Language for Mechanical Transla 
tion", Mechanical Translation , Vol. 5# No. 1, July, 195° • 
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